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STATEMENT REGARDING FEDERALLY SPONSORED 
RESEARCH OR DEVELOPMENT 
The present invention was supported in part by grant number ROl CA72023 from the 
National Institutes of Health (NIH/NCI). The U.S. Government has certain rights in the 
5 invention. 

FffiLD OF THE INVENTION 
The present invention relates generally to medical imaging. More particularly, the present 
invention relates to methods for differentiating normal and abnormal anatomical shapes 
10 through a feature estimation method and a classifier. 

BACKGROUND 

Colon cancer is the second leading cause of cancer deaths in the United States. American 
adults have 1/20 chance of developing and 1/40 chance of dying from this disease. There 

15 are approximately 150,000 new cases diagnosed each year resulting in 56,000 deaths (See 
e.g. PJ Wingo (1995) in a paper entitled "Cancer Statistics" and published in Ca Cancer 
Journal Clin. 45:8-30). Previous research has shown that adenomatous polyps, particularly 
those larger than 1 cm in diameter, are the most likely precursor to subsequent colorectal 
carcinoma (See e.g. a paper by RF Thoeni et al. (1994) entitled ''Polyps and cancer" and 

20 published in ''Textbook of Gastrointestinal Radiology, Philadelphia: W.B. Saunders, 
pi 160). The National Polyp Study clearly illustrated that colonoscopic removal of all 
identifiable polyps resulted in a decline in mortality rate between 76% and 90% compared 
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to historical controls (See e.g. a paper by SJ Winawer et. al. (1993) entitled ^^Prevention of 
colorectal cancer by colonoscopic polypectomy" and published in N. Engl. J. Med. 
329:1977-1981). Unfortunately, colon cancer is most often discovered after the patient 
develops symptoms, and by then, the likelihood of a cure has diminished substantially. 

5 

Fiberoptic colonoscopy (FOC) is considered the definitive diagnostic test (See e.g. a paper 
by DM Eddy (1990) entitied Screening for colorectal cancer" and published in Ann. 
Intern. Med., 113 373-384) for the presence of colonic polyps as it affords direct 
visualization and the opportunity for biopsy or removal of suspicious lesions (See e.g. the 

10 referenced paper supra by Winawer et. al.). However, FOC is not feasible as a population 
screening test due to cost, the small but real risk of complications such as perforation, and 
due to the fact that there are not sufficient endoscopists in the countiy to accommodate all 
patients. Moreover, tiie majority of colonoscopic examinations performed in tiie United 
States are negative for polyps or masses, therefore, a less invasive, more widely available 

15 procedure that is also acceptable to patients is atti-active. 

Computed tomography colonography (CTC) (also referred to as virtual colonoscopy) is a 
recently proposed non-invasive technique that combines spiral CT data acquisition of the 
air-filled and cleansed colon with 3D imaging software to create virtual endoscopic images 
20 of the colonic surface. The initial clinical results are quite promising, yet the technique is 
still impractical due, in part, to the time required to review hundreds of images per patient 
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study. This limitation begs for a computer-aided detection (CAD) method to help the 
radiologist detect polyps efficientiy from the acquired CTC data. 

Identifying colonic polyps using CAD is challenging because they come in various sizes 
5 and shapes, and because thickened folds and retained stool may mimic their shape and 
density. FIG. 1 demonstrates the appearance of polyps and healthy tissue as they appear in 
a virtual colonoscopy study. 

Initial studies describing CAD for polyp detection have focused on shape analysis and 
10 started from the intuitive observation on the similarity of the polyp shape to hemispheres. 
Summers et al. characterized the colon wall by computing its minimum, maximum, mean 
and Gaussian curvatures (See a paper by RM Sunmiers et al. (2000) entitled ''Polypoid 
lesions of airways: early experience with computer-assisted detection by using virtual 
bronchoscopy and surface curvature" and published in Radiology 208(2):331-337), 
15 Following discrimination of polypoid shapes by their principal minimum and maximum 
curvatures, more restrictive criteria such as sphericity measures are applied in order to 
eliminate non-spherical shapes. In Yoshida et al. (H. Yoshida et al. (2000) in a paper 
entitled '"Detection of colonic polyps in CT colonography based on geometric features'' and 
published in Radiology 217(SS):582-582) use topological shape of vicinity of each voxel, 
20 in addition with a measure for the shape curvedness to distinguish polyps from healthy 
tissue. Gokturk and Tomasi designed a method based on the observation that the bounding 
surfaces of polyps are usually not exact spheres, but are often complex surfaces composed 
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of small, approximately spherical patches (See SB Gokturk & C Tomasi (2000j in a paper 
entitled "A graph method for the conservative detection of polyps in the colon"" and 
published at the 2nd International Symposium on Virtual Colonoscopy November 2000 
Boston, USA). In this method, a sphere is fit locally to the isodensity surface passing 
5 through every CT voxel in the wall region. Groups of voxels that originate densely 
populated nearby sphere centers are considered as polyp candidates. Obviously, the 
clusters of the sphere centers are denser when the underlying shape is a sphere or a 
hemisphere. 

10 Paik et al. introduced a method based on the concept that normals to the colon surface will 
intersect with neighboring normals depending on the local curvature features of the colon 
(DS Paik et al. (2000) in a paper entitled "Computer aided detection of polyps in CT 
colonography: Free response roc evaluation of performance " and published in Radiology 
217(SS)370 and DS Paik et al. (1999) in a paper entitled "Detection of polyps in CT 

15 colonography: A comparison of a computer aided detection algorithm to 3-D visualization 
methods'" and published in Radiological Society of North America 85th Scientific Sessions. 
Chicago, IL: Radiological Soc, N Amer, p. 428), This method is based the observation that 
polyps have 3-D shape features that change rapidly in many directions, so that normals to 
the surface tend to intersect in a concentrated area. By contrast, haustral folds change their 

20 shape rapidly when sampled transversely, resulting in convergence of normals, but change 
shape very slowly when sampled longitudinally. As a result, the method detects the polyps 
by giving the shapes a score based on the number of intersecting normal vectors. This 
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score is higher in hemispherical polyps compared with folds. 



While most of the current methods have demonstrated promising sensitivity (i.e. ability to 
detect positives), they can be considered more as polyp-candidate detectors than polyp 
detectors because of their large number of false positive detections. Some of the methods 
incorporate simple intuitions about the shapes of polyps and non-polyps, which leads to 
false positive detections. For instance it has been observed that polyps have spherical 
shapes and provided different measures of sphericity. However, polyps span a large variety 
of shapes, and fitting spheres alone is not an accurate measure. As mentioned supra, polyp 
recognition is a difficult problem, but so is the manual identification of discriminating 
criteria between polyps and healthy tissue. Therefore, if an automatic method merely 
results in identifying polyp candidates, then manual examination of a (potentially large) 
number of images corresponding to the CAD outputs is required to ensure proper polyp 
detection. Such an examination is costly, time consuming and inefficient. Accordingly, 
there is need for new detection methods to recognize polyps in medical images and 
differentiate these polyps from healthy tissue. 



SUMMARY OF THE INVENTION 

The present invention is a method for detecting and classifying a shape in a medical image. 
It is based on generating a plurality of 2-D sections through a 3-D volume in the medical 
image. In general, the present invention is a combination of two steps. The first step is 
feature estimation to generate shape signatures for candidate volumes containing candidate 
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shapes. The feature estimation method computes descriptors of objects or of their images. 
The second general step involves classification of these shape signatures for diagnosis. A 
classifier contains, builds and/or trains a database of descriptors for previously seen shapes, 
and then maps descriptors of novel images to categories corresponding to previously seen 
5 shapes or classes of shapes. 

The 2-D sections differ from each other in position, orientation or position and orientation. 
The main idea here is to develop statistics of information on these sections that represent 
many variations of the shape. In contrast to previous methods, the method of the present 
10 invention does not explicitly make any assumptions about the shape. By choosing 
reasonable attributes, it captures many characteristics of shapes accurately. Consequently, 
the approach is applicable to other 3D pattern recognition problem and also to the discovery 
of any new shapes. The generality is assured by including the new shape samples to the 
training set of the algorithm, and designing new planar attributes if necessary. 

15 

In one exemplary embodiment, the plurality of 2-D sections are a plurality of triples of 
mutually orthogonal planes, which could also be randomly selected. Feature estimation 
could be accomplished through the step of determining intensity features or statistics. In 
one example, feature estimation includes the steps of fitting hnes, one or more circles, one 
20 or more ellipses, one or more quadratic curves, one or more rectangles, or one or more 
parallel lines. In one aspect, shape signatures could be obtained via a histogram of a 
plurality of shape signatures. Shape signatures could also be obtained via vector 
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quantization. The classifier could be a support vector machines classifier, which could be 
optimized with training data. 

The present invention could be utilized as a post-processing method that would refine the 
5 results of a high sensitivity, low specificity shape detection method, resulting in an 
increasing specificity without sacrificing sensitivity. In this case, candidate shapes or 
volumes containing a candidate shape would be the input to the present method. The 
present method is, however, not limited to its application as a post-processing method since 
it could also be used as an independent method without pre-processing identification. 

10 

The advantage of the present invention is that it does not make any assumptions about the 
to-be-detected shape as is conmion in prior art methods. Another advantage is that if a pre- 
processing method is used, and method of present invention is used as a post-processing 
method, then the present method does not have to make assumptions of the pre-processing 
15 method. Yet another advantage is that it eliminates the guesswork for obtaining the 
distinguishing features. 

BRIEF DESCRIPTION OF THE FIGURES 
The objectives and advantages of the present invention will be understood by reading the 
20 following detailed description in conjunction with the drawings, in which: 

FIG. 1 shows examples of polyps (a-c) and of healthy tissue (d-e) that have similar 
shapes; 
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FIGS. 2-3 show a general overview of the method according to the present invention; 
FIG. 4 shows an overview of the ROSS method according to the present invention; 
FIG. 5 shows an example of three cross sections of a body through the anatomical 

directions (a) axial, (b) coronal, (c) sagittal, shows an example of two 3-D 
5 renderings of a polyp with different camera views (d), and shows random 

oriented, mutually orthogonal triple planes through this polyp (e-g), all 

according to the present invention; 
FIG. 6 shows an example of optimum segmentation window in various images 

according to the present invention; 
10 FIG. 7 shows an example of volume renderings of different shapes (top row) and a 

random cross section through these shapes (bottom row) according to the 

present invention. These images are specific examples where fitting a circle 

(a), an ellipse (b), parallel lines (c), and a line (d) produces output about the 3- 

D shape; 

15 FIG. 8 shows examples according to the present invention of the primitive shapes: (a) 
a randomly oriented plane, (b) the edges in the image, (c)-(e) circle, quadratic, 
and line fit to the edges shown in (b). (f) Another randomly oriented plane 
with nearly parallel lines (g) parallel lines fit to the edges of the plane in (f); 
FIG. 9 shows an example of high dimensional histogram bins in 2-D according to the 

20 present invention: (a) Data in two dimensions; (b) Uniform histogram bins; (c) 

Bins found when clustering is applied. The storage gain (number of bins) is 
fivefold in this particular example; and 
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FIG. 10 shows an example of two polyps (left), a very similarly shaped fold structure 
(right), and their signatures (center) according to the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 
Although the following detailed description contains many specifics for the purposes of 
illustration, anyone of ordinary skill in the art will readily appreciate that many variations 
and alterations to the following exemplary details are within the scope of the invention. 
Accordingly, the following preferred embodiment of the invention is set forth without any 
loss of generality to, and without imposing limitations upon, the claimed invention. 

In general, the present invention is a combination of two method steps as shown in FIGS. 
2-3. The first step is feature estimation 210 to generate shape signatures for candidate 
volumes containing candidate shapes. The feature estimation method computes descriptors 
of objects or of their images. In one example, the feature estimation is based on a Random 
Orthogonal Shape Sections method (ROSS) 310. The second general step involves 
classification 220, 320 of these shape signatures for diagnosis. A classifier contains, builds 
and/or trains a database of descriptors (330) for previously seen shapes, and then maps 
descriptors of novel images to categories corresponding to previously seen shapes or 
classes of shapes. In one example the classifier is a support vector machines classifier. 

The present method could be utilized as a post-processing method that would refine the 
results of a high sensitivity, low specificity candidate shape detection method, resulting in 
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an increasing specificity without sacrificing sensitivity. In this case, candidate shapes or 
volumes containing a candidate shape would be the input to the present method. The 
present method is, however, not limited to its application as a post-processing method since 
it could also be used as an independent method without pre-processing identification. 

The images are typically digital or computerized images such as, for instance, but not 
limited to, a CT, an MRI, a digitized X-ray, or any other image application that could be 
converted or rendered to a digital image. The images could be 2-D images useful to 
construct a 3-D image or a 3-D volumetric image. A shape that is targeted to be identified 
or diagnosed is, for instance, but not limited to, a shape that contains pre-cancerous tissue 
or cancerous tissue. These shapes are of interest to radiologists and are for instance polyps 
(e.g. colonic polyps), nodules, (e.g. liver and lung nodules), lesions, or the like. In case of 
colonic polyp, these shapes are differentiated from e.g. non-polyp shapes, tissue or volumes 
included in the normal colon, thickened haustral folds, and foci of retained stool. Even 
though the description focuses on CT images of the colon to diagnose colonic polyps, the 
present method could easily be applied in any type of application where it is necessary to 
characterize a shape and diagnose whether this shape corresponds to a structure of interest. 
The following describes in more detail the two general method steps. 

1. Random Orthogonal Shape Sections Method (ROSS) 

FIG. 4 gives an overview of the ROSS method. Here, the object is to mimic the ways 
radiologists view CT images of the colon, i.e., by for instance viewing images parallel to 
the axial, sagittal, and coronal planes (FIG. 5). While viewing these three planes gives 
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substantial information about the 3D shape, the information is incomplete. Therefore, and 
generally, in the approach of the present invention, we first obtain a large number of 
different 2-D sections (also referred to as planes) through a volume that includes the shape 
of interest. These 2-D sections could differ from each other in their orientation and/or 

5 position with the intent to cover as many different directions and/or positions through the 
volume and therewith the shape. In one embodiment, these 2-D sections are randomly 
selected. In a specific embodiment, these 2-D sections are a large number of random triples 
of mutually orthogonal sections through a suspicious subvolume. In one example, about a 
hundred random triples of mutually perpendicular images could be extracted from a 

10 candidate subvolume. Although this number could vary, one would realize that even 
though a reduction of triples would reduce the time required for processing, however this 
would be a trade-off with accuracy. If a less than adequate number of views is used, the 
triples could be biased toward some specific orientation in space. To identify an adequate 
number for a particular application, an easy attempt is to test the reproducibility of the 

15 ROSS method. That is the shapes should have the same signatures regardless of the 
random views used in the analysis. In a reproducibility test, one could also consider testing 
synthetic shapes such as cyUnders, ellipsoids and spheres. 

The main idea is that the statistics of information collected over a sufficiently large number 

20 of 2-D sections or triples gives a diagnostic and invariant signature of the shape. Because 

other structures may co-occupy the resulting images, the next step is to segment them to 

isolate the putative polyp for further processing. We then calculate a set of geometric 

attributes from each random plane, and generate a histogram of these geometric attributes 
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obtained from each triple as a feature- vector to represent the shape. Accumulating 
histograms of these attributes over several random triples makes the resulting signatures 
invariant to rotations and translations. Careful choice of geometric attributes results in 
robust signatures. Details of the image segmentation and image-based geometric attributes 
are given in section 1.1 and section 1.2 respectively. This histogram is described in section 
1.3. 

1.1. Image Segmentation 

In each image we first construct a square critical window located in the center of the image, 
surrounding the suspicious structure and excluding as much of the extraneous tissue as 
possible. The size of the critical window varies depending on the size and the shape of the 
suspicious structure in the image. A window is considered good when it contains a shape 
that is approximately circular and has a small circular radius. Because elongated folds are 
common in the colon, it was found to be useful to also explicitly rule out elongated regions 
when there is a circular region in the image. To fmd a critical window's size, an image is 
first binarized by intensity thresholding, and the following target function (Equation 1) is 
computed for each window of size w centered in the middle of the image: 

f(w) = fliKw) + (w) - (hCi^iw) (1) 

Here, r(w) is the radius of the circle that best fits the edges in the binarized image, CcircieM 
is the residual to the best fitting circle, eiJw) is the residue to the best fitting line all in the 
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subwindow w, and ai and 02 are constants. Details of the shape fitting are described in the 
next section, w could be varied from, for instance, 8 mm to 24 nmi and the value of w that 
yields the smallest /fwj is chosen as the critical window size to calculate the geometric 
attributes. FIG, 6 shows an example of the resulting critical window for several different 

5 images with different suspicious lesions. In some cases, minimizing Equation 1 may result 
in a critical window that includes some portion of the surrounding colon wall. To further 
accommodate these cases, a Gaussian weighting function, location at the image center, 
could be used while fitting primitive shapes. By weighting with this Gaussian mask, we 
aim to give more importance to boundary points of a potential polypoid shape than to those 

10 of the surrounding colon wall. 

1.2. Image-Based Geometric Attributes and Features 

Once the optimal critical subwindow in a particular image/section is found, we can extract 
shape and intensity related geometric attributes (features) in the critical window. For this 

15 purpose, we can fit primitive shapes such as one or more lines, one or more circles, one or 
more quadratic curves, one or more rectangles, and one or more parallel sets of lines to the 
largest connected edge component, i.e., boundary, of the shape. Each primitive shape could 
be fit in a least squares manner. The boundary in each random plane could for instance be 
determined by a 2D Sobel derivative operation (See RC Gonzales et al. (1993) in a book 

20 entitled "Digital Image Processing'' and published by Addison-Wesley) followed by 
thresholding. The mathematical details of the primitive shape fittings are described in 
section 3. 
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A random slice of a sphere is a circle. Thus, fitting circles is a means of measuring the 
sphericity of the 3-D shape (FIG. 7(a)). When doing so, the residuals at each pixel on the 
boundary are first weighted with a Gaussian located at the image center as mentioned in the 
section supra. The Gaussian weighted residuals are then added together, and the least- 
square solution gives the optimum circle. The residual to the least-square solution is 
recorded as well. 

Similarly, the ellipsoidal nature of the shape can be measured by fitting a quadratic to the 
edges. This way, we can capture the similarity of the shape to a pedunculated polyp (FIG. 
7(b)). The quadratic invariants given in section 3, as well as the residual to the least-square 
solution are recorded. The cross section of a fold on the image plane may contain 
approximately parallel lines (FIG. 7(c)). To capture this structure, we apply parallel lines 
analysis, which includes fitting lines to the two largest connected components of the 
boundary points. We record the residual to these lines and the distance and angle between 
these approximately parallel lines. Similarly, the residual to the optimum fitting line gives 
information on the flat-ness of the surface (FIG. 7(d)). 

FIG. 8(a) and (b) give an example of a randomly oriented plane and the edges on this 
plane, respectively FIG. 8(c), (d) and (e) illustrate three primitive shapes, circle, quadratic 
and line being fit to this random plane. FIG. 8(f) shows another random plane going 
through a thick fold and FIG. 8(g) shows two approximately parallel lines being fit to its 
edges. 
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To extract information on higher order frequency characteristics of the boundary, 3rd order 
moment invariants are computed as well (See a paper by MK Hu (1962) entitled ''Visual 
pattern recognition by moment invariants'' and published in IRE transactions on 
5 information theory ITS: 179-187), This gives information on the curviness of the boundary 
points. 

In addition to all of these shape-based features, intensity features are extracted from the 
tissue part of the image. For this, the tissue part is first separated away from the 
10 background by intensity thresholding and, next, the intensity mean and standard deviation 
of the tissue are calculated and recorded. The intensity-based features are not 
discriminative features in polyp recognition, therefore, these features could be omitted 
when used in polyp recognition applications. These features, however, can be useful in 
another medical shape recognition problem. 

15 

All the attributes mentioned so far are calculated for each random triple of images. The 
three images in each triple are sorted in the order of increasing radius of curvature, and the 
features above are listed into a vector in this order. The resulting A^-vector containing the N 
attributes from the three planes represents the signature of the shape, relative to that 
20 particular triple of perpendicular planes. 

2.3. Obtaining the Histograms in High-Dimensional Space 
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The following step in the method of the present invention is to obtain histograms of feature 
vector distributions over the randomly oriented 2-D sections or triples. Given enough 2-D 
sections or triples, these histograms are essentially invariant to orientation and position. An 
efficient solution, proposed here, represents a histogram by first computing the 

5 representatives for the main clusters of features over a large collection of vectors. New 
feature vectors are then assigned to these clusters, rather than to fixed uniform bins. This 
method is called vector quantization (For a reference see a book by A Gersho & RM Gray 
(1992) entitled "Vector Quantization and Signal Compression" and pubUshed by Kluwer 
Academic Press). FIG. 9 shows an example in 2-D space. Note that, storage efficiency 

10 increases exponentially as the number of dimensions increase beyond two. 

In the implementation of the method of the present invention one could suppose that Xy is 
the iV-vector obtamed from the / random triple of perpendicular planes extracted from the 
i* shape in a training set. One could then obtain X^Js from all of the shapes, and then 

15 invoke the ^means algorithm (for a reference see A Gersho & RM Gray (1992) in a book 
entitled ''Vector Quantization and Signal Compression'' and published by Kluwer Academic 
Press) to compute vector clusters. The cluster centers are first initialized to a random 
selection from Xt/s. Subsequent iterations of the ^means algorithm then alternately 
reassign vectors to clusters and re-compute cluster centers, resulting eventually in the 

20 optimum cluster centers. 

Once the representative histogram bin centers are determined, a histogram of feature 
vectors belonging to each particular shape is calculated. When forming these feature 
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histograms, the simplest strategy would be to have each vector increment a counter for the 
nearest cluster center. This method, however, is overly sensitive to the particular choice of 
clusters. In the present method a more robust solution is adopted in which each feature 
vector partitions a unit vote into fractions that are inversely proportional to the vector's 
5 distances to all histogram bin centers. The histograms thus obtained, one per candidate 
volume, are the rotation and translation invariant shape signatures used for classification as 
described in the next section. FIG. 10 shows examples of signatures obtained from three 
different shapes. 

10 2. Classifier 

Given the representative vectors for polyps and healthy tissue, the optimum classifier is 
computed using training data. Subsequently the optimum classifier is used on test data. In 
the particular example of CTC, a training set is defined as a collection of candidate volumes 
that are classified by fiberoptic colonoscopy (FOC) and CT colonoscopy to be polyps or 
15 non-polyps. 

In general, a support vector machines (SVM) classifier aims to find the optimal 
differentiating hypersurface between the two classes in the training set (See e.g. V Vapnik 
(1998) in a book entitled "Statistical Learning Theory, New York). The optimal 
20 hypersurface is the one that not only correctly classifies the data, but also maximizes the 
margin of the closest data points to the hypersurface. 
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Mathematically, we consider the problem of separating the training set S of points 
X, 6 /?" with i=l,2,....,N. Each data point x, belongs to either class and, thus, is given a 
label £ {-1,1} • SVM implicitly transforms the given feature vector x, into new vectors 
(^(x) m a space with more dimensions, such that the hypersurface that separates the x, 
becomes a hyperplane in the space of (^(x)s. Thus the linearly non-separable data in the 
original space becomes linearly separable in a higher dimension. Finding the hypersurface 
is a well-studied optimization problem that maximizes the distance of the hypersurface to 
the margin points, so called support vectors (See e.g. V Vapnik (1998) in a book entitled 
-Statistical Learning Theory, New York; B Scholkopf (1997) in a book entitled ^Support 
Vector Learning- and published by Munich, Germany; R Oldenbourg Verlag; K 
Veropoulos et al. (1999) in a paper entitled "Controlling the sensitivity of support vector 
machines" and presented at the Int. Joint Conf. AI (IJCAr99), Stockholm, Sweden). 
Support vectors are essentially the elements that carry the differentiating characteristics 
between polyps and nonpolyps. In the classification of a test vector x, only the support 
vector needs to be considered, and the following expression needs to be calculated: 

d{x)= ^a,yiK(x,,x) + b (2) 

where the constants a, h are computed by the classifier-learning algorithm, and the x, are 
the computed support vectors (SVs). K is called the Kernel fimction, and is used to map the 
data from its original dimension to higher dimensions so that the data is linearly separable 
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in 



the mapped dimension (For reference see B Scholkopf (1997) in a book entitled ''Support 
Vector Learning' and published by Munich, Germany; R Oldenbourg Verlag). Given the 
expression d(x), the decision is given based on the sign oid(x): 



J polyp if dix)>0 

decision = ] , M^\^r\ 

[non - polyp if d(,x) < U 



In other words, in the classification process of a test case, it is checked which side of the 
hypersurface the element resides. If it is in the polyp side, it is decided that the new case is 
a polyp, and otherwise, it is a non-polyp. It could also be given a score to each element by 
10 looking at its distance from the optimal hypersurface. 

In clear contrast with previous classification methods, such as linear discriminant analysis 
(LDA), or nearest neighbor methods that minimize the empirical risk (risk of 
misclassification in the training set), SVMs minimize the structural risk, given as the 
15 probability of misclassifying previously unseen data, by utilizing the optimal hyperplane 
between two classes of data. In addition, SVMs pack all the relevant information in the 
training set into a small number of support vectors and use only these vectors to classify 
new data. This way, the distinguishing features of the two classes are implicitly identified 
while the detracting less important points are discarded. In other classification methods 
20 such as LDA or neural networks, the classification surfaces are obtained by taking all of the 

data points into account, which might statistically deteriorate the performance of these 

methods. 
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3. Primitive Shape Fitting 

Let be a collection of points in the image boundary with image coordinates (Xi, y,) with / = 
\, ...N. In this section, examples of mathematical details are described to fit primitive 
5 shapes such as circle, quadric curve, and line to the collection of points Pi. 

A circle is defined by its three parameters: The coordinates of its center (xo, yo) and its 
radius r. The points Pi satisfy the circle equation 



10 



[2x,,2);,,l] 



yo 

2 2 2 

-xo-yo. 



(4) 



In order to find the three unknowns (xo, yo, r), equations coming from each point are stacked 
and a linear least-square, solution is obtained. Let A and b be the following matrix and 



vector: 



15 



A = 



2x, 2y^ 1 
2jc2 2y^ 1 



2x^ 2yN 1_ 



B = 



+ yi 

2 2 

x2 + y2 
4 + yl 



(5) 
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then [Xf^yor^ - xl - ylf = (A^A)"' A^feis the least-square solution to the best fitting circle. 



2 T 

Similarly, the residue to the least-square solution is = Aix^yy -xl-y^] -b. 



A quadratic is defined by the following equation: 



[x^yi,2Xiy^,,2xiay,i] 



= [-1] 



(6) 



Similar to the circle equations, the matrix A and the vector b are built by stacking the left 
and right sides of the equation, thus, the least-square solution is given by 

10 [abed ef ={A'^Ay'A^b. The least-square residue is given by 



^quadratic 



T 

,,=A[a b c d e] -b. 



The quadratic equation can also be written in the following form: 



15 



= 0 



(7) 



where Tis given by: 
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T = 



a c d 
c b e 
d e 1 



(8) 



The singular values of the matrix are invariant to rotations of the quadratic shape and, thus, 
5 are recorded as quadric invariants. 

The equation of a line is defined by a point on the line and its direction. Given a collection 

of points {Xi, y^) for z = 1. N, the best fitting line (in the least-square sense) goes through 

the central point {xo, yo)- The line itself can then be written as: 



























yy. 



(9) 



where v = [v^ + Vy]^is the direction of the line. Let {x^,y,)he the normalized point 
coordinates of Pu 

15 



r ^ 
X, ^ 


_ X, 






Ji- 


Ji- 




_3^o. 



(10) 



V is then given as the first singular vector of M, where Mis tiie following matrix: 
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M = 



X2 

lyJi 



(11) 



Given v and {xo, yo) the best fitting line equation is complete. For any given pomt Pi, the 
5 error to this line is given as: 







X. 












V - 


X. 




e^line — 




Ji. 


Ji. 





(12) 



4. Conclusion 

10 The present invention has now been described in accordance with several exemplary 
embodiments, which are intended to be illustrative in all aspects, rather than restrictive. 
Thus, the present invention is capable of many variations in detailed implementation, which 
may be derived from the description contained herein by a person of ordinary skill in the 
art. For example, in one aspect the choice of kernel function in the support vector machines 

15 classification is a parameter that could directly affect the results. One could choose kernel 
function such as, but not limited to, radial basis functions, exponential radial basis 
functions, polynomial functions, linear kernel, and histogram-based kernels. In another 
aspect to expand the differentiation of different 3-D shapes one could use different 
geometrical properties that are determined from each triple of planes or a combination of 2- 

20 D sections. In yet another aspect, related to the training process, the resulting support 
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vectors could give valuable feedback for designing new features for the classifying system. 
More specifically, the clusters of close by support vectors from the two classes belong to 
similar shapes in polyps and nonpolyps. These closeby support vectors will help observe 
the distinguishing features of similar polyps and nonpolyps. In still another variation, the 
5 classifier could also be a hidden markov model based classifier, a neural network based 
classifier, a fuzzy-based classifier, a genetic algorithm based classifier, or any other 
classifier capable of classifying the shape signature information. 

Note that while the present invention has been described in the context of a method, those 
10 skilled in the art will appreciate that the method of the present invention is capable of being 
distributed in the form of a computer readable medium of instructions in a variety of forms, 
and that the present invention applies equally regardless of the particular type of signal 
bearing medium used to actually carry out the distribution. In other words, the present 
invention is also a program storage device accessible by a computer, tangible embodying a 
15 program of instructions or means executable by the computer to perform method steps for 
detection and classification of a shape in a medical image as described supra. Examples of 
computer readable media include recordable type media such as disks and CD-ROMS and 
transmission type media such as digital and analog communication links. In addition, the 
present invention could be implemented and coded in different programming languages 
20 and/or packages. All such variations and other variations are considered to be within the 
scope and spirit of the present invention as defined by the following claims and their legal 
equivalents. 
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